Bayes estimators for phylogenetic reconstruction
نویسندگان
چکیده
Tree reconstruction methods are often judged by their accuracy, measured by how close they get to the true tree. Yet, most reconstruction methods like maximum likelihood (ML) do not explicitly maximize this accuracy. To address this problem, we propose a Bayesian solution. Given tree samples, we propose finding the tree estimate that is closest on average to the samples. This "median" tree is known as the Bayes estimator (BE). The BE literally maximizes posterior expected accuracy, measured in terms of closeness (distance) to the true tree. We discuss a unified framework of BE trees, focusing especially on tree distances that are expressible as squared euclidean distances. Notable examples include Robinson-Foulds (RF) distance, quartet distance, and squared path difference. Using both simulated and real data, we show that BEs can be estimated in practice by hill-climbing. In our simulation, we find that BEs tend to be closer to the true tree, compared with ML and neighbor joining. In particular, the BE under squared path difference tends to perform well in terms of both path difference and RF distances.
منابع مشابه
Empirical Bayes Estimators with Uncertainty Measures for NEF-QVF Populations
The paper proposes empirical Bayes (EB) estimators for simultaneous estimation of means in the natural exponential family (NEF) with quadratic variance functions (QVF) models. Morris (1982, 1983a) characterized the NEF-QVF distributions which include among others the binomial, Poisson and normal distributions. In addition to the EB estimators, we provide approximations to the MSE’s of t...
متن کاملEstimation and Reconstruction Based on Left Censored Data from Pareto Model
In this paper, based on a left censored data from the twoparameter Pareto distribution, maximum likelihood and Bayes estimators for the two unknown parameters are obtained. The problem of reconstruction of the past failure times, either point or interval, in the left-censored set-up, is also considered from Bayesian and non-Bayesian approaches. Two numerical examples and a Monte Carlo simulatio...
متن کاملClassic and Bayes Shrinkage Estimation in Rayleigh Distribution Using a Point Guess Based on Censored Data
Introduction In classical methods of statistics, the parameter of interest is estimated based on a random sample using natural estimators such as maximum likelihood or unbiased estimators (sample information). In practice, the researcher has a prior information about the parameter in the form of a point guess value. Information in the guess value is called as nonsample information. Thomp...
متن کاملLimiting Properties of Empirical Bayes Estimators in a Two-Factor Experiment under Inverse Gaussian Model
The empirical Bayes estimators of treatment effects in a factorial experiment were derived and their asymptotic properties were explored. It was shown that they were asymptotically optimal and the estimator of the scale parameter had a limiting gamma distribution while the estimators of the factor effects had a limiting multivariate normal distribution. A Bootstrap analysis was performed to ill...
متن کاملChoice of topology estimators in Bayesian phylogenetic analysis.
Wheeler WC and Pickett KM (2008. Topology-Bayes versus clade-Bayes in phylogenetic analysis. Mol Biol Evol. 25:447-453.) discuss two ways of summarizing the posterior probability distribution of a Bayesian phylogenetic analysis, which they refer to as "topology-Bayes" and "clade-Bayes." They claim that the clade-Bayes approach leads to problems such as "exaggerated clade support, inconsistently...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Systematic biology
دوره 60 4 شماره
صفحات -
تاریخ انتشار 2011